A New Arabic (ahd/amsh) Handwritten Database

نویسندگان

  • AMER AL-NASSIRI
  • SHUBAIR A. ABDULLA
چکیده

This paper introduces new database for Arabic handwritten words. The Arabic handwritten database (AHD/AMSH) represents a utility to facilitate the experiments of the character recognition algorithms. It contains three types of images: word, isolated character, and digit images. The AHD/AMSH can be used for baseline detection, characters segmentation, normalization, thinning, training and testing purposes. The stages of construction of the AHD/AMSH database were planned carefully to ensure its excellence. 150 words, 35 courtesy amount and 20 digits were used to fill the form which has been filled by 82 writers in 5 different age groups. The results were 12300 words, 29028 sub-words, 56170 characters, 2870 courtesy amounts, 820 Indian digits, and 820 Arabic digits. After dividing the database into two categories, training and testing, it has been tested manually and systematically. Keyword: Handwritten Arabic characters, Cursive writing, Database, Handwritten recognition

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Off-Line Arabic Handwritten Word Segmentation Using Rotational Invariant Segments Features

This paper describes a new segmentation algorithm for handwritten Arabic characters using Rotational Invariant Segments Features (RISF). The algorithm evaluates a large set of curved segments or strokes through the image of the input Arabic word or subword using a dynamic feature extraction technique then nominates a small “optimal” subset of cuts for segmentation. All the directions of stroke ...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

HMM Based Approach for Handwritten Arabic Word Recognition Using the IFN/ENIT- Database

An offline recognition system for Arabic handwritten words is presented. The recognition system is based on a semi-continuous 1-dimensional HMM. From each binary word image normalization parameters were estimated. First height, length, and baseline skew are normalized, then features are collected using a sliding window approach. This paper presents these methods in more detail. Some parameters ...

متن کامل

Deep Learning Autoencoder Approach for Handwritten Arabic Digits Recognition

This paper presents a new unsupervised learning approach with stacked autoencoder (SAE) for Arabic handwritten digits categorization. Recently, Arabic handwritten digits recognition has been an important area due to its applications in several fields. This work is focusing on the recognition part of handwritten Arabic digits recognition that face several challenges, including the unlimited vari...

متن کامل

Isolated Persian/Arabic handwriting characters: Derivative projection profile features, implemented on GPUs

For many years, researchers have studied high accuracy methods for recognizing the handwriting and achieved many significant improvements. However, an issue that has rarely been studied is the speed of these methods. Considering the computer hardware limitations, it is necessary for these methods to run in high speed. One of the methods to increase the processing speed is to use the computer pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007